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A POTENTIAL EFFECTOR FOR THE GRB7 FAMILY OF SIGNALLING 

PROTEINS 

Field of the Invention: 

5 

The present invention relates to a novel polynucleotide molecule 
encoding a candidate effector protein for the Grb7 family of signalling 
proteins. Detection of the encoded protein in a tissue sample should provide 
a useful tumour marker and/or prognostic indicator. Furthermore, 
10 antagonism of the interaction between Grb7 family members and the 
encoded protein should provide a novel treatment strategy for hvunan 
diseases exhibiting aberrant receptor tyrosine kinase (RTK) signalling (e.g. 
cancer). 

15 Background of the Invention 

RTKs play a major role in the regulation of cellular growth, 
differentiation, motility and metabolism by converting an extracellular signal 
in the form of the binding of a specific hormone or growth factor to the 

20 activation of specific signalling pathways and hence modes of intracellular 
communication (Schlessinger and Ullrich, Neuron 9. 383-391, 1992). 
Activation of RTKs results in both autophosphorylation of the receptor and 
the phosphorylation of downstream targets on tyrosine residues. It has 
become evident over the last decade that key elements in receptor-substrate 

25 and other protein-protein interactions in RTK signalling are src homology 
(SH)2 domains. SH2 domains are conserved modules of approximately 100 
amino acids found in a wide variety of signalling molecules which bind to 
short tyrosine-phosphorylated peptide sequences. The specificity of 
interaction is determined both by the nature of the amino acids flanking the 

30 phosphotyrosine residue in the target peptide and residues in the SH2 

domain which interact with these sites (Pawson, Nature 373, 573-580, 1995). 

SH2-domain containing proteins can be divided into two classes: those 
which possess a catalytic function (e.g. the cytoplasmic tyrosine kinase c-src 
and the tyrosine phosphatase SH-PTP2) and those which consist entirely of 

35 non-catalytic protein domains (eg Grb2), the adaptor sub-class. The function 
of the latter class is to link separate catalytic subunits to a tyrosine- 
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phosphorylated receptor or signalling intermediate, and other non-catalytic 
protein modules are often involved in these interactions. For example, SH3 
and WW domains (conserved regions of approximately 50 and 40 amino 
acids, respectively) bind proline-rich peptide ligands. and pleckstrin 
5 homology domains (approximately 100 amino acids) interact with both 
specific phospholipid and protein targets (Pawson, 1995 supra). 

The Grb7 family represents a family of SH2 domain-containing 
adaptors which currently contains three members: Grb7. 10 and 14 (Margolis 
et al, Proc. Natl. Acad, Sci, USA 89, 8894-8898, 1992: Stein aL EMBO J 13, 

10 1331-1340, 1994; Ooi et al Oncogene 10. 1621-1630. 1995: Daly ef aL /. BioL 
Chem. 271, 12502-12510, 1996), These proteins share a common overall 
architecture, consisting of an N- terminal region containing a highly 
conserved proline-rich decapeptide motif, a central region harbouring a PH 
domain and a C-terminal SH2 domain. The central region of approximately 

15 300 amino acids bears significant homology to the C. elegans protein miglO, 
which is required for long range neuronal migration in embryos, otherwise 
the Grb7 family and miglO are structurally distinct. However, they exhibit 
differences in both SH2 selectivity towards RTKs (Janes et al, /. BioL Chem. 
272, 8490-8497, 1997) and tissue distribution. The family has therefore 

20 evolved to link particular receptors to downstream effectors in a tissue- 
specific manner. Interestingly, the genes encoding this family appear to have 
co-segregated with ERBB family genes during evolution. Thus GRB7, 10 and 
14 are linked to ERBB2. ERBBl (epidermal growth factor receptor) and ERBB4, 
respectively (Stein et al 1994 supra; Ooi et aL 1995 supra: Baker et al, 

25 Genomics 36, 218-220, 1996). The juxtaposition of GRB7 and ERBB2 leads to 
common co-amplification in hvunan breast cancers, and since the two gene 
products are functionally linked, likely up-regulation of an undefined erbB2 
signalling pathway. Furthermore, GRB14 also exhibits differential expression 
in human breast cancers (Daly et al, 1996 supra). These two proteins may 

30 therefore modulate RTK signalling in this disease. 

In order to identify proteins which bind to this family and therefore 
identify candidate effector's, we performed a genetic screen using the yeast 
two hybrid system and Orb 14 "bait". This application describes the cloning 
and characterization of a novel interacting protein, currently designated 

35 2.2412. 
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3 

Disclosure of tlie Invention: 

Thus, in a first aspect, the present invention provides an isolated 
polynucleotide molecule encoding a candidate effector protein for the Grb7 
5 family of signalling proteins, wherein the polynucleotide molecvile comprises 
a nucleotide sequence having at least 75% sequence identity to that shown as 
SEQID NO: 1. 

Preferably, the polynucleotide molecule comprises a nucleotide 
sequence having at least 85%, more preferably at least 95%, sequence 
10 identity to that shown as SEQ ID NO: 1. Most preferably, the polynucleotide 
molecule comprises a nucleotide sequence encoding a polypeptide 
comprising an amino acid sequence substantially corresponding to that 
shown as SEQ ID NO: 2. 

In a preferred embodiment of the invention of the first aspect, the 
15 polynucleotide molecule comprises a nucleotide sequence which 
substantially corresponds to that shown as SEQ ID NO: 1. 

The polynucleotide molecule may be a dominant negative mutant 
which encodes a gene product causing an altered phenotype by, for example, 
reducing or eliminating the activity of endogenous effector proteins of the 
20 Grb7 family of signalling proteins. 

The polynucleotide molecule may be incorporated into plasmids or 
expression vectors (including viral vectors), which may then be introduced 
into suitable host cells such as bacterial, yeast, insect and mammalian host 
cells. Such host cells may be used to express the protein encoded by the 
25 polynucleotide molecule. 

Accordingly, in a second aspect, the present invention provides a host 
cell transformed with the polynucleotide molecule of the first aspect. 

In a third aspect, the present invention provides a method of producing 
a protein, comprising culturing the host cell of the second aspect under 
30 conditions suitable for the expression of the polynucleotide molecule and 
optionally recovering the protein. 

Preferably, the host cell is mammalian or of insect origin. Where the 
cell is mammalian, it is presently preferred that it be a Chinese hamster 
ovary (CHO) cell or hmnan embryonic kidney (HEK) 293 cell. Where the 
35 host cell is of insect origin, it is presently preferred that it be an insect Sf9 
cell. 
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In a fourth aspect the present invention provides a purified protein 
encoded by the polynucleotide molecule of the first aspect. 

In a preferred embodiment of this aspect, the piuified protein 
comprises an amino acid sequence substantially corresponding to that shown 
5 as SEQIDNO: 2. 

In a fifth aspect, the present invention provides a fusion protein 
comprising an amino acid sequence substantially corresponding to that 
show^n as SEQ ID NO: 2. „ 

Fusion proteins according to the fifth aspect may include an N- 
10 terminal fragment of a protein such as [3-galactosidase to assist in the 

expression and selection of host cells expressing candidate effector protein, 
or may include a functional fragment of any other suitable protein to confer 
additional activity(ies). 

In a sixth aspect, the present invention provides an antibody or 
15 fragment thereof w^hich specifically binds to the protein of the fourth aspect. 

The antibody may be monoclonal or polyclonal, however, it is 
presently preferred that the antibody is a monoclonal antibody. Suitable 
antibody fragments include Fab, F(ab')2 and scFv. 

In a seventh aspect, the present invention provides an oligonucleotide 
20 probe comprising a nucleotide sequence of at least 12 nucleotides, the 
oligonucleotide probe comprising a nucleotide sequence such that the 
oligonucleotide probe selectively hybridises to the polynucleotide molecule 
of the first aspect under high stringency conditions (Sambrook et ah. 
Molecular Cloning: a Lahoratoiy ManuaL Second Edition, Cold Spring Harbor 
25 Laboratory Press) . 

In a preferred embodiment of this aspect, the oligonucleotide probe is 
labelled. In a further preferred embodiment of this aspect, the 
oligonucleotide probe comprises a nucleotide sequence of at least 18 
nucleotides. 

30 In an eighth aspect, the present invention provides a method of 

detecting in a sample the presence of an effector protein for the Grb7 family 
of proteins, the method comprising reacting the sample with an antibody or 
fragment thereof the sixth aspect, and detecting the binding of the antibody 
or fragment thereof. 
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The method of the eighth aspect may be conducted using any 
immunoassays well known in the art (e.g. ELISA). The sample may be, for 
example, a cell lysate or homogenate prepared from a tissue biopsy. 

In a ninth aspect, the present invention provides a method of detecting 
in a sample the presence of mRNA encoding an effector protein for the Grb7 
family of proteins, the method comprising reacting the sample with an 
oligonucleotide probe of the seventh aspect, and detecting the binding of the 
probe. 

The method of the ninth aspect may be conducted using any 
hybridisation assays well known in the art (e.g. Northern blot). The sample 
may be a poly(A) RNA preparation or homogenate prepared from a tissue 
biopsy. 

Grb7 family proteins exhibit differential expression in certain human 
cancers (particularly breast and prostate cancer) and may therefore be 
involved in tumour progression. Detection of the protein encoded by the 
cDNA 2.2412 in a sample should provide a useful tumour marker and/or 
prognostic indicator for these cancers. Furthermore, the interaction of Grb7 
family members with 2.2412 may provide a novel target for therapeutic 
intervention. 

It is to be understood that methods of detecting suitable agonists and 
methods of therapy utilising detected agonists also form part of the present 
invention. 

The term "substantially corresponds" as used herein in relation to the 
nucleotide sequence shown as SEQ ID NO: 1 is intended to encompass minor 
variations in the nucleotide sequence which due to degeneracy in the DNA 
code do not result in a change in the encoded protein. Further, this term is 
intended to encompass other minor variations in the sequence which may be 
required to enhance expression in a particular system but in which the 
variations do not result in a decrease in biological activity of the encoded 
protein. 

The term "substantially corresponding" as used herein in relation to the 
amino acid sequences shown as SEQ ID NO: 2 is intended to encompass 
minor variations in the amino acid sequences which do not result in a 
decrease in biological activity of the protein. These variations may include 
conservative amino acid substitutions. The substitutions envisaged are:- 

G, A, V, L L, M: D. E; N, Q; S, T; K, R, H: F, Y. W, H; and 
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P, Na-alkalamino acids. 

The terms "comprise", "comprises" and "comprising" as used 
throughout the specification are intended to refer to the inclusion of a stated 
step, component or featiue of group of steps, components of features with or 
5 without the inclusion of a further step, component or feature or group of 
steps, components or features. 

The invention will hereinafter be described with reference to the 
accompanying figure and the following, non-limiting example. 

10 Brief description of the accompanying figure; 

Figure 1 provides tlie nucleotide and amino acid (single letter code) 
sequence of 2.2412. Ninnbers refer to distances in l)nse pairs. Ankyrin-type repeat 
sequences are underlined. An additional repeat sequence is indicated by italics. 
15 The stop codon is represented by an asterisk. The original cDNA clone 2.2412 
isolated by the two hybrid screen spans nucleotides 694-2664 of this sequence. 

Figure 2 provides a map of the 2.2412-binding region on Grbl4. 

A. Structure of the deletion constructs used in the analysis. Gal4 DNA-BD fusion 
20 constructs encoding full length Grbl4 (FL), the N-terniinal (N), central region (C) 

and N-terminal H- central region (N -h C) were generated in the vector pAS2.1. 

B. Results of P-galactosidase activity assays following transformation of the above 
plasmids into yeast strain Y190 together with the original 2.2412 cDNA clone in 
pACT-2. 

25 

Example; CLONING AND CHARACTERISATION OF 2.2412 

Yeast two hybrid screen 

The yeast two hybrid system exploits protein-protein interactions to 

30 reconstitute a functional transcriptional activator which can then be detected 
using a gene reporter system (Fields and Sternglanz. 77G. 10. 286-292. 1994). 
The technique takes advantage of the properties of the Gal4 protein of the 
yeast S. cerevisiae. The Gal4 DNA binding domain (DNA-BD) or activation 
domain (AD) alone are incapable of inducing transcription. However, an 

35 interaction between two proteins synthesized as DNA-BD- and AD-fusions. 
respectively, brings the Gal4 domains into close proximity and results in 
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transcriptional activation of two reporter genes [HIS3 and LacZ) which can be 
monitored by growth on selective medium and biochemical assays. 

A plasmid construct encoding a Gal4 DNA-BD-Grbl4 fusion was 
generated as follows. The plasmid GRBl 4/pRcCMVF containing full length 
5 GRB14 cDNA (Daly et ah 1996) was restricted with Hindlll and Klenow 
treated to create bkmt ends, and then digested with Bell to release three 
fragments of approximately 1.1, 4.2 and 1.7 kb. The 1.7 kb fragment was 
isolated and cloned into the Ndel (Klenow treated) and BamUl sites of the 
yeast expression vector pAS2.1 (Clontech) to generate G/?Bi4/pAS2.1 

10 containing an in-frame fusion of full length Grbl4 with the GAL4 DNA-BD. 
This construct was introduced by electroporation into the yeast strain 
CG1945 [MATa, um3-52. his3'200, ade2'101, lys2-801,trp 1-901, leu2'3. 112, 
gal4'542, gal80-538, cyh^^2. LYS2::GALlUAS'GALlTATA-HIS3, 
URA3::GAL42 7mers(x3)-CYClTATA-lQcZ] selecting for tryptophan 

15 prototrophy. The expression of the fusion protein was verified by Western 
blot analysis with antibodies directed against the Flag epitope and the Gal4 
DNA-BD. The recipient strain was then grown to mid-log phase and a human 
liver cDNA library in the vector pACT2 (Clontech) introduced using the LiAc 
procedure (Schiestl and Gietz, Curr, Genet 16, 339-346, 1989). Transformants 

20 were then selected for tryptophan, leucine and histidine prototrophy in the 
presence of 5mM 3-aminotriazole. 

From a screen of 1x10^ clones, 39 colonies were initially selected on 
synthetic complete (SC)-leu-his-trp -I-3AT medium and were then tested for 
p-galactosidase activity. 12 clones scored positive in the latter assay and were 

25 subjected to cycloheximide (CHX) curing to remove the bait plasmid by 

streaking out on SC-leu media containing lOug/ml CHX (pAS2-l contains the 
CYH2 gene which restores CHX sensitivity to CG1945 cells). This enabled 
confirmation of the bait dependency of LacZ activation and subsequent 
isolation of the pACT2 plasmids encoding interacting proteins by standard 

30 methodology (Philippsen et al. Methods in Enzymology 194, 170-177). Back 
transformations were then performed in which these pACT2 plasmids were 
introduced into CG1945 strains containing the bait plasmid [GRBl 4/pAS2-l] 
or constructs encoding non-related Gal4 DNA-BD fusions in order to confirm 
the specificity of the interactions. 

35 The DNA sequences of the cDNA inserts were then obtained by cycle 

sequencing (f-mol kit, Promega) using pACT2-specific and/or clone-specific 
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primers. Based on their nucleotide sequences the 12 interacting clones were 
classified into 6 independent groups (see Table I). 



TABLE I: Characterization of cDNA clones isolated by the yeast two 
hybrid screen . 



Class 


No. of 
clones 


Identity 


Mean RLU 
(Liquid assay) 


Colour intensity 
(Filter assay) 


10 1 


6 


Nedd4 


2.86x10" 


+ + + + 


2 


2 


Htk 


1.86xl0" 


+ + 


3 


1 


2.2412 


5.18x10" 


+ + + + 


4 


1 


Proteosome 


3.88x10'^ 


+/- 


5 


1 


Somatostatin 


1.45x10^ 


+/- 


15 




receptor 


8.61x10'^ 




6 


1 


L-argiiiine:glycine 


+/- 






amidinotransferase 







Thn 12 clones oxhihiting aclivalion of both liin //AS.'/ arul Ia<:Z roporlor gonns vvnm rlividiuJ 
20 ink) t3 groups by soqiinnai niuilysis ni Ihoir cDNA inscMis. Rosiills (if |^-gnInc1nsiHnsn nctivily assays 
poiTonnod using two innihodnlogios nro shown. Thn li(|ui(l nnllnni-tlfMivofi ni(Uhofi (Cialaclo-bight. 
TKOPIX) is morn qnanlilntivn: insults aro given in nw^an rnlalivn light units (I^IAJ) and am nonnaliznfl 
for the proKMii content of thn saniph^s. Blue/white screening of llm cDNA r:lonns was also perlV)nned 
using a colony lift fiHer assay ((:if)ntech). The intensity of lilue colour developnunit over 
25 approximately 2h is scored from +/- (very weak) to + + + + (strongl. 

Six clones were partial cDNAs corresponding to Nedd4, a multidomain 
protein containing a calcium-dependent phospholipid binding (CaLB) 
domain, four WW domains and a C-terminal region homologous to the E6-AP 

30 carboxyl-terminus (Kumar et aL Biochem, Biopbys. Res. Commun. 185. 1155- 
llGl, 1992: Sudol et a] J. BioL Chem. 270, 14733-14741, 1995; Huibregtse et al 
Proc, Natl Acad, ScL USA 92, 2563-2567, 1995). The latter is likely to confer 
E3 ubiquitin-protein ligase activity on Nedd4, The pACT2 clones isolated 
encoded the CaLB domain together with the first 22 amino acids of the first 

35 WW domain. 
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Two clones encoded the intracellular region and part of the 
extracellular domain of Htk. which is a RTK of the Eph family (Bennett et al 
/. BioL Chem. 269. 14211-14218, 1994). The recruitment of Grbl4 by Htk is of 
interest for two reasons. First, the expression profile of both Htk and the 
5 murine homologvie myk-1 are indicative of a potential role in mammary 

gland development and neoplasia (Andres al Oncogene 9. 1461-1467. 1994: 
Berclaz et al Biochem. Biophys, Bes. Comm. 226. 869-875. 1996). Second, Eph 
family members may be involved in the regulation of cell migration (Tessier- 
Lavigne, Cell 82. 345-348. 1995). which is intriguing given the homology of 

10 the Grb7 family to the C. elegans protein miglO (Stein et aL 1994 supra), 

A novel cDNA of 1971 bp, designated 2.2412. was also isolated. This clone 
encoded a polypeptide of 657 amino acids in frame with the Gal4 DNA-BD. The 
cDNA did not contain a stop codon. and this, together with the Northern analysis 
described below, indicated that it was incomplete. This DNA fragment was 

15 therefore used as a probe to screen a human placental cDNA library (5' STRETCH 
PLUS. Clontech. in XgtlO). This resulted in the isolation of two clones, designated 
clone 8 and clone 12. Clone 8 was approximately 2 kb and overlapped the original 
2.2412 clone by 900 bp at the 3' end. This clone provided the carboxy-terminal 
end of the 2.2412 protein sequence (Figure 1). Clone 12 was approximately 3.5 kb 

20 and to date has provided an additional 692 bp of sequence information in the 5' 
direction. The nucleotide and protein sequence for 2.2412 provided by these 
overlapping clones is shown in Figure 1. Since a 5' initiation codon has yet to be 
identified the coding sequence still appears to be incomplete. 

25 Fuitlier characterization of 2,2412 

Database searches using the 2.2412 cDNA sequence revealed 
significant homology with a large number of proteins containing ankyrin-like 
repeats. These sequences were first identified as homologous regions 
between certain cell cvcle regulatory proteins and the Drosophila protein 

30 Notch (Breeden and Nasmyth. Nature 329. 651-654. 1987) but subsequently 
they have been identified in a wide variety of other proteins where they are 
thought to function in protein-protein interactions (Bork, Proteins 17, 363- 
374, 1993). Subsequent analysis of the protein sequence identified 18 
consecutive ankyrin repeats and an additional repetitive element (Figure 1). 

35 The ankyrin repeat region is followed by a stretch of approximately 40 amino 
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acids rich in serine residues. The remaining C-terminal region has a 
relatively high content of charged amino acids. 



Northern aiialysis of 2.2412 mRNA expression 
5 Northern blot analysis of multiple tissue northerns (Clontech) was 

performed using the original 2.2412 cDNA as a probe. This resulted in the 
detection of a single mRNA transcript of approximately 7 kb in all tissues 
examined with the exception of the kidney. Expression was particularly high 
in skeletal muscle and placenta. The size of this transcript compared to that 
10 of the 2.2412 clone indicates that the latter represents only a partial cDNA. 

Genomic localization of the 2,2412 sene 

Fluorescence in situ hybridization of the original 2.2412 cDNA to 
normal metaphases (Baker et al, 1996 supra] and reference to the FRAlOA 

15 fragile site at 10q23,32 localized the gene to between chromosome 10q23.2 
and proximal 10q23.32. Interestingly, deletions in the 10q22-25 region of 
chromosome 10 have been detected in a variety of human cancers including 
breast, prostate, renal, small cell lung and endometrial carcinomas, 
glioblastoma multiforme, melanoma and meningiomas, suggesting the 

20 presence of one or more tumour suppressive loci in this region (Li et al, 

Science 275, 1943-1947. 1997; Steck et al. Nature Genetics 15, 356-362. 1997, 
and references therein). Two candidate tumour suppressor genes have been 
identified in this region (MMACl/PTEN and MXIl. Li et al 1997 supra: Steck 
et al 1997 supra; Albarosa et al. Hum, Genet, 95, 709-711, 1995). 

25 

Analysis of the interaction between 2,2412 and Grh? family members 

cDNAs encoding the full length and N- and C-terminal regions of the 
original 2.2412 cDNA clone (nucleotides 694-2664, 694-1614 and 1615-2664 
of the sequence shown in Figure 1, respectively) were cloned into the vector 

30 pGEX4T2 (Pharmacia). The full length construct was generated by 

subcloning from the pACT2 clone as a Ndel fragment, whereas the shorter 
constructs were synthesized by directional cloning of PGR products. The 
corresponding GST-fusion proteins were purified from IPTG-induced 
bacterial cviltures using glutathione-agarose beads (Smith and Johnson, Gene 

35 67, 31-40, 1988). These immobilized fusion proteins were then incubated 

with lysates from cells expressing Flag epitope-tagged Grbl4 (Daly ef al, 1996 
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supra] or human breast cancer cells expressing high levels of Grb7 (SK-BR-3: 
Stein et aL 1994) as described previously (Daly aL 1996). Following 
washing, boinid proteins were detected by Western blot analysis. The results 
indicated that 2.2412 bound specifically to both Grbl4 and Gxhl in vitro, and 
5 that the N-tenninal fusion protein bound more strongly than that derived 

from the C-terminus, These data, obtained using a different methodology for 
detecting protein-protein interactions to the yeast two hybrid system, confirm 
that 2.2412 interacts with Grbl4. Furthermore, 2.2412 also binds Grb7. 
Consequently 2.2412 appears to represent a general effector for the Grb7 
10 family. 

Mapping of the 2,2412 binding resion on Grbl4 

In order to identify the region of Grbl4 that interacts with 2.2412, a 
series of Grbl4 deletion mutants were generated by cloning PGR fragments 

15 synthesized using the appropriate flanking primers into the vector pAS2.1. 

These fragments spanned the following regions: N-terminus ("N". amino acids 
1-110), the central region ("C") encompassing the miglO homology and the 
"between PH and SH2" (BPS) domain (amino acids 110-437) and the N- 
terminal and central regions ("N + C", amino acids 1-437). These plasmids 

20 were individually transformed into the yeast strain Y190 [hdATa, iira3'52, 
his3-200, ade2'101, lys2'801, trpl-901, leu2-3, 112, gal4A, gal80A, cyh^2, 
LYS2::GAL1UAS-HIS3TATA'HIS3, URA3::GALlUAS-GALlTATA-lcicZ) and 
expression of the appropriately sized Gal4 DNA-BD fusion proteins 
confirmed by Western blotting. Following transformation of the resulting 

25 yeast strains with the original 2.2412 cDNA clone in pACT-2, the strength of 
the interaction was determined by either liquid- or filter-based p- 
galactosidase assays. The results are presented in Figure 2, and demonstrate 
that the N-terminal region of Grbl4 is not only required, but is also 
sufficient, for binding 2.2412. This supports the hypothesis that 2.2412 

30 represents a general effector for the Grb7 family, since the N-terminal region 
of these proteins contains a highly conserved proline-rich motif which may 
mediate this interaction. 
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It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
5 be considered in all respects as illustrative and not restrictive. 
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Sequence listings : 

SEQUENCE LISTING 

Applicant: Garvan Institute of Medical Research 

Title of Invention: A potential effector for the Grb7 family of signalling 
proteins . 



Current Application Number: 
Current Filing Date: 

Prior Application Number : P0938 8 

Prior Application Filing Date: 1997-09-23 

Number of ID SEQ Nos : 2 



Software: PatentIn Ver. 2.0 



SEQ ID NO: 1 
Length: 3400 
Type : DNA 

Organism: Homo sapiens 



Sequence: 1 

attcctcttc ataatgcatg ctcttttggt catgctgaag tagtcaatct ccttttgcga 60 

catggtgcag accccaatgc tcgagataat tggaattata ctcctctcca tgaagctgca 120 

attaaaggaa agattgatgt ttgcattgtg ctgttacagc atggagctga gccaaccatc 180 

cgaaatacag atggaaggac agcattggat ttagcagatc catctgccaa agcagtgctt 240 

actggtgaat ataagaaaga tgaactctta gaaagtgcca ggagtggcaa tgaagaaaaa 300 

atgatggctc tactcacacc attaaatgtc aactgccacg caagtgatgg cagaaagtca 360 

actccattac atttggcagc aggatataac agagtaaaga ttgtacagct gttactgcaa 420 

catggacgtg atgtccatgc taaagataaa ggtgatctgg taccattaca caatgcctgt 480 

tcttatggtc attatgaagt aactgaactt ttggtcaagc atggtggctg tgtaaatgca 540 

atggacttgt ggcaattcac tcctcttcat gaggcagctt ctaagaacag ggttgaagta 600 

tgttctcttc tcttaagtta tggtgcagac ccaacactgc tcaattgtaa gaataaaagt 660 

gctatagact tggctcccac accacagtta aaagaaagat tagcatatga atttaaaggc 720 

cactcgttgc tgcaagctgc acgagaagct gatgttactc gaatcaaaaa acatctctct 780 

ctggaaatgg tgaatttcaa gcatcctcaa acacatgaaa cagcattgca ttgtgctgct 840 

gcatctccat atcccaaaag aaagcaaata tgtgaactgt tgctaagaaa aggagcaaac 900 

atcaatgaaa agactaaaga attcttgact cctctgcacg tggcatctga gaaagctcat 960 

aatgatgttg ttgaagtagt ggtgaaacat gaagcaaagg ttaatgctct ggataatctt 1020 

ggtcagactt ctctacacag agctgcatat tgtggtcatc tacaaacctg ccgcctactc 1080 

ctgagctatg ggtgtgatcc taacattata tcccttcagg gctttactgc tttacagatg 1140 

ggaaatgaaa atgtacagca actcctccaa gagggtatct cattaggtaa ttcagaggca 1200 

gacagacaat tgctggaagc tgcaaaggct ggagatgtcg aaactgtaaa aaaactgtgt 1260 

actgttcaga gtgtcaactg cagagacatt gaagggcgtc agtctacacc acttcatttt 1320 

gcagctgggt ataacagagt gtccgtggtg gaatatctgc tacagcatgg agctgatgtg 1380 

catgctaaag ataaaggagg ccttgtacct ttgcacaatg catgttctta cggacattat 1440 

gaagttgcag aacttcttgt taaacatgga gcagtagtta atgtagctga tttatggaaa 1500 

tttacacctt tacatgaagc agcagcaaaa ggaaaatatg aaatttgcaa acttctgctc 1560 

cagcatggtg cagaccctac aaaaaaaaac agggatggaa atactccttt ggatcttgtt 1620 

aaagatggag atacagatat tcaagatctg cttaggggag atgcagcttt gctagatgct 1680 

gccaagaagg gttgtttagc cagagtgaag aagttgtctt ctcctgataa tgtaaattgc 1740 

cgcgataccc aaggcagaca ttcaacacct ttacatttag cagctggtta taataattta 1800 

gaagttgcag agtatttgtt acaacacgga gctgatgtga atgcccaaga caaaggagga 1860 

cttattcctt tacataatgc agcatcttac gggcatgtag atgtagcagc tctactaata 1920 
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aagtataatg 
gcccaaaagg 
cttaaaaatc 
cttctgacag 
aatggtgtga 
tcaagccttt 
tcagtagtta 
ggagtagatt 
atatttgaga 
aaggagattg 
cttatctccg 
acaa ttctta 
caaagtacag 
aatattctca 
agaaaagaag 
tctccttttg 
ggtatgtttg 
tatggaattg 
cacaggcagc 
atgaaaatgg 
ggcctagcat 
a ttacttacc 
aactaa ttcc 
tgctgaaaaa 
gtttaacatt 



catct ctcaa 
gacgaacaca 
aggaaggaca 
cagccatgcc 
gaagcccagg 
ctgcagccag 
gttcaagtgg 
ttagcataac 
gagaacagat 
gaatcaatgc 
gacaacaagg 
tagatctgtc 
ttcgagagca 
agattcagaa 
tttctgaaga 
tgaatgcaat 
gagctggcat 
gaggaggtac 
tgctcttttg 
cacattctcc 
tagctgaata 
agattatgag 
actgaaccta 
aaatcatctt 
ctgacttgat 



t gccacgga c 
gctt tgtgct 
aacaccttt a 
ccca tctgct 
agccactgca 
cagtcttgac 
aacagagggt 
tcaattcgta 
cactttggat 
ttatggaca t 
tcttaaccca 
tcctgatgat 
cagagatgga 
ggtttgtaac 
aaaccacaac 
tatccacaaa 
ttattttgct 
tgggtgtcca 
ccgggtaacc 
tccaggt ca t 
tgttatttac 
gcct.gaaggt 
aaa tea tcaa 
gcccacaggc 
aaagctttaa 



aaatgggctt 
ttgttgctag 
gatttagttt 
ctgccctctt 
gatgctctct 
aacttatctg 
gcttccagtt 
aggaatcttg 
gtattagttg 
aggcacaaac 
tatttaactt 
aaagagtttc 
ggtcatgcag 
aagaaactat 
catgccaatg 
ggctttgatg 
gaaaactctt 
gttcacaaag 
ttgggaaagt 
ca ctcagtca 
agaggagaac 
atggtcgatg 
agcagcagtg 
ctgtggcaaa 
taatgtacag 



tcacacct 1 1 
cccatggaqc 
cagcagatga 
gttacaagcc 
cttcaggtcc: 
ggagtttttc 
tggagaaaaa 
gacttgagca 
aga tggggca 
taattaaagg 
tgaacacctc 
agtctgtgga 
gtggaat ctt 
gggaaagata 
aacgaatgct 
aaaggcatgc 
ccaaaagcaa 
acagatcttg 
ctttcctgca 
ctggtaggcc 
aggctta tec 
gataaatagt 
gcct ctacgt 
aggataaaaa 



gcacgaagca 
tgacccgact 
tgtcagcgct 
tcaagtgctc 
a tctagccca 
agaactgtct 
ggaggttcca 
cctaatggat 
caaggagctg 
agtcgagaga 
tggtagtgga 
ggaagagatg 
caacagatac 
cactcaccgg 
atttcatggg 
gtacataggt 
tcaatatgta 
ttacatttgc 
gttcagtgca 
cagtgtaaat 
tgagta tt ta 
tattttaaga 
tttactcctt 
tgtgaacgaa 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3400 



SEQ ID NO: 2 
Length: 1074 
Type: PRT 

Organism: Homo sapiens 
Sequence: 2 

lie Pro Leu His Asn Ala Cys Ser Phe Gly His Ala Glu Val Val Asn 
15 10 15 



Leu Leu 



Tyr Thr 



lie Val 
50 



Leu Arg 
20 

Pro Leu 
35 



His Gly Ala Asp 
His Glu 



Leu Leu Gin His 



Ala Ala 
40 

Gly Ala 

55 



Pro Asn 
25 

lie Lys 



Glu Pro 



Ala Arg Asp 



Gly Lys lie 
45 

Thr lie Arg 
60 



Asn Trp Asn 
30 

Asp Val Cys 
Asn Thr Asp 



Gly Arg 
65 



Thr Ala 



Leu Asp 
70 



Leu Ala Asp Pro 



Ser Ala Lys 
75 



Ala Val Leu 
80 



Thr Gly Glu Tyr 



Lys Lys 
85 



Asp Glu 



Leu Leu 
90 



Glu Ser Ala 



Arg Ser Gly 
95 



Asn Glu 



Glu Lys 
100 



Met Met Ala Leu 



Leu Thr 
105 



Pro Leu Asn 



Val Asn Cys 
1 10 



His Ala 



Tyr Asn 
130 



Ser Asp Gly Arg 
115 

Arg Val Lys lie 



Lys Ser 
120 

Val Gin 
135 



Thr Pro 



Leu Leu 



Leu His Leu 
125 

Leu Gin His 
140 



Ala Ala Gly 
Gly Arg Asp 



Val His Ala Lys Asp Lys Gly Asp Leu Val Pro Leu His Asn Ala Cys 
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145 

Ser Tyr Gly His 



Cys Val Asn Ala 
180 

Ala Ser Lys Asn 
195 

Ala Asp Pro Thr 
210 

Ala Pro Thr Pro 
225 

His Ser Leu Leu 



Lys His Leu Ser 
260 

Glu Thr Ala Leu 
275 

Gin lie Cys Glu 
290 

Thr Lys Glu Phe 
305 

Asn Asp Val Val 



Leu Asp Asn Leu 
340 

His Leu Gin Thr 
355 

lie He Ser Leu 
370 

Val Gin Gin Leu 
385 

Asp Arg Gin Leu 



Lys Lys Leu Cys 
420 

Arg Gin Ser Thr 
435 

Val Val Glu Tyr 
450 

Lys Gly Gly Leu 



150 

Tyr Glu Val Thr 
165 

Met Asp Leu Trp 



Arg Val Glu Val 
200 



Leu Leu Asn Cys 

215 

Gin Leu Lys Glu 
230 

Gin Ala Ala Arg 
245 

Leu Glu Met Val 



His Cys Ala Ala 
280 

Leu Leu Leu Arg 
295 

Leu Thr Pro Leu 
310 

Glu Val Val Val 
325 

Gly Gin Thr Ser 



Cys Arg Leu Leu 
360 

Gin Gly Phe Thr 
375 

Leu Gin Glu Gly 
390 

Leu Glu Ala Ala 
405 

Thr Val Gin Ser 



Pro Leu His Phe 
440 

Leu Leu Gin His 
455 

Val Pro Leu His 



155 

Glu Leu Leu Val 
170 

Gin Phe Thr Pro 
185 

Cys Ser Leu Leu 



Lys Asn Lys Ser 
220 

Arg Leu Ala Tyr 
235 

Glu Ala Asp Val 
250 

Asn Phe Lys His 
265 

Ala Ser Pro Tyr 



Lys Gly Ala Asn 
300 

His Val Ala Ser 
315 

Lys His Glu Ala 
330 

Leu His Arg Ala 
345 

Leu Ser Tyr Gly 



Ala Leu Gin Met 
380 

He Ser Leu Gly 
395 

Lys Ala Gly Asp 
410 

Val Asn Cys Arg 
425 

Ala Ala Gly Tyr 



Gly Ala Asp Val 
460 

Asn Ala Cys Ser 



160 

Lys His Gly Gly 
175 

Leu His Glu Ala 
190 

Leu Ser Tyr Gly 
205 

Ala Tie Asp Leu 



Glu Phe Lys Gly 
240 

Thr Arg He Lys 
255 

Pro Gin Thr His 
270 

Pro Lys Arg Lys 
285 

He Asn Glu Lys 



Glu Lys Ala His 
320 

Lys Val Asn Ala 
335 

Ala Tyr Cys Gly 

350 

Cys Asp Pro Asn 
365 

Gly Asn Glu Asn 



Asn Ser Glu Ala 
400 

Val Glu Thr Val 
415 

Asp He Glu Gly 
430 

Asn Arg Val Ser 
445 

His Ala Lys Asp 



Tyr Gly His Tyr 
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465 

Glu Val Ala Glu 



Asp Leu Trp Lys 
500 

Tyr Glu lie Cys 
515 

Lys Asn Arg Asp 
530 

Thr Asp lie Gin 
545 

Ala Lys Lys Gly 



Asn Val Asn Cys 
580 

Leu Ala Ala Gly 
595 

His Gly Ala Asp 
610 

His Asn Ala Ala 
625 

Lys Tyr Asn Ala 



Leu His Glu Ala 
660 

Leu Ala His Gly 
675 

Pro Leu Asp Leu 
690 

Ala Met Pro Pro 
705 

Asn Gly Val Arg 



Pro Ser Ser Pro 
740 

Ser Gly Ser Phe 
755 



Glu Gly Ala Ser 
770 

Ser lie Thr Gin 



470 

Leu Leu Val Lys 
485 

Phe Thr Pro Leu 



Lys Leu Leu Leu 
520 

Gly Asn Thr Pro 
535 

Asp Leu Leu Arg 
550 

Cys Leu Ala Arg 
565 

Arg Asp Thr Gin 



Tyr Asn Asn Leu 
600 

Val Asn Ala Gin 
615 

Ser Tyr Gly His 
630 

Ser Leu Asn Ala 
645 

Ala Gin Lys Gly 



Ala Asp Pro Thr 
680 

Val Ser Ala Asp 
695 

Ser Ala Leu Pro 
710 

Ser Pro Gly Ala 
725 

Ser Ser Leu Ser 



Ser Glu Leu Ser 
760 

Ser Leu Glu Lys 
775 

Phe Val Arg Asn 



475 

His Gly Ala Val 
490 

His Glu Ala Ala 
505 

Gin His Gly Ala 



Leu Asp Leu Val 
540 

Gly Asp Ala Ala 
555 

Val Lys Lys Leu 
570 

Gly Arg His Ser 
585 

Glu Val Ala Glu 



Asp Lys Gly Gly 
620 

Val Asp Val Ala 
635 

Thr Asp Lys Trp 
650 

Arg Thr Gin Leu 
665 

Leu Lys Asn Gin 



Asp Val Ser Ala 
700 

Ser Cys Tyr Lys 
715 

Thr Ala Asp Ala 
730 

Ala Ala Ser Ser 
745 

Ser Val Val Ser 



Lys Glu Val Pro 
780 

Leu Gly Leu Glu 



480 

Val Asn Val Ala 
495 

Ala Lys Gly Lys 
510 

Asp Pro Thr Lys 
525 

Lys Asp Gly Asp 



Leu Leu Asp Ala 
560 

Ser Ser Pro Asp 
575 

Thr Pro Leu His 
590 

Tyr Leu Leu Gin 
605 

Leu lie Pro Leu 



Ala Leu Leu lie 
640 

Ala Phe Thr Pro 
655 

Cys Ala Leu Leu 
67 0 

Glu Gly Gin Thr 
685 

Leu Leu Thr Ala 



Pro Gin Val Leu 
720 

Leu Ser Ser Gly 
735 

Leu Asp Asn Leu 
750 

Ser Ser Gly Thr 
765 

Gly Val Asp Phe 



His Leu Met Asp 
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785 790 795 800 

lie Phe Glu Arg Glu Gin lie Thr Leu Asp Val Leu Val Glu Met Gly 
805 810 815 

His Lys Glu Leu Lys Glu lie Gly lie Asn Ala Tyr Gly His Arg His 
820 825 830 

Lys Leu lie Lys Gly Val Glu Arg Leu lie Ser Gly Gin Gin Gly Leu 
835 840 845 

Asn Pro Tyr Leu Thr Leu Asn Thr Ser Gly Ser Gly Thr lie Leu lie 
850 855 860 

Asp Leu Ser Pro Asp Asp Lys Glu Phe Gin Ser Val Glu Glu Glu Met 
865 870 875 880 

Gin Ser Thr Val Arg Glu His Arg Asp. Gly Gly His Ala Gly Gly lie 
885 890 895 

Phe Asn Arg Tyr Asn lie Leu Lys lie Gin Lys Val Cys Asn Lys Lys 
900 905 910 

Leu Trp Glu Arg Tyr Thr His Arg Arg Lys Glu Val Ser Glu Glu Asn 
915 920 925 

His Asn His Ala Asn Glu Arg Met Leu Phe His Gly Ser Pro Phe Val 
930 935 940 

Asn Ala lie lie His Lys Gly Phe Asp Glu Arg His Ala Tyr lie Gly 
945 950 955 960 

Gly Met Phe Gly Ala Gly lie Tyr Phe Ala Glu Asn Ser Ser Lys Ser 
965 970 975 

Asn Gin Tyr Val Tyr Gly lie Gly Gly Gly Thr Gly Cys Pro Val His 
980 985 990 

Lys Asp Arg Ser Cys Tyr lie Cys His Arg Gin Leu Leu Phe Cys Arg 
995 1000 1005 

Val Thr Leu Gly Lys Ser Phe Leu Gin Phe Ser Ala Met Lys Met Ala 
1010 1015 1020 

His Ser Pro Pro Gly His His Ser Val Thr Gly Arg Pro Ser Val Asn 
1025 1030 1035 1040 

Gly Leu Ala Leu Ala Glu Tyr Val lie Tyr Arg Gly Glu Gin Ala Tyr 
1045 1050 1055 

Pro Glu Tyr Leu lie Thr Tyr Gin lie Met Arg Pro Glu Gly Met Val 
1060 1065 1070 

Asp Gly 
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Claims: 

1. An isolated polynucleotide molecule encoding a candidate effector 
protein for the Grb7 family of signalling proteins, wherein the polynucleotide 

5 molecule comprises a nucleotide sequence having at least 75% sequence 
identity to that shown as SEQ ID NO: 1. 

2. A polynucleotide molecule according to claim 1, wherein the 
polynucleotide molecule comprises a nucleotide sequence having at least 

10 85% sequence identity to that shown as SEQ ID NO: 1. 

3. A polynucleotide molecule according to claim 1, wherein the 
polynucleotide molecule comprises a nucleotide sequence having at least 
95% sequence identity to that shown as SEQ ID NO: 1. 

15 

4. A polynucleotide molecule according to claim 1, wherein the 
polynucleotide molecule comprises a nucleotide sequence which 
substantially corresponds to that shown as SEQ ID NO: 1. 

20 5. A host cell transformed with a polynucleotide molecvile according to 
any one of the preceding claims. 

6. A host cell according to claim 5, wherein the host cell is a mammalian, 
insect, yeast or bacterial host cell. 

25 

7. A method of producing a protein, comprising culturing the host cell of 
claim 5 or 6 under conditions suitable for the expression of the 
polynucleotide molecule and optionally recovering the protein. 

30 8. A purified protein encoded by a polynucleotide molecule according to 
any one of claims 1 to 4. 

9. A purified protein according to claim 8, wherein the protein comprises 
an amino acid sequence substantially corresponding to that shown as SEQ ID 
35 NO: 2. 
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10. A fusion protein comprising an amino acid sequence substantially 
corresponding to that shown as SEQ ID NO: 2. 

11. An antibody or fragment thereof which specifically binds to a protein 
5 according to claim 8 or 9. 

12. An oligonucleotide probe comprising a nucleotide sequence of at least 
12 nucleotides, the oligonucleotide probe comprising a nucleotide sequence 
such that the oligonucleotide probe selectively hybridises to the 

10 polynucleotide molecule of any one of claims 1 to 4 under high stringency 
conditions. 

13. An oligonucleotide probe according to claim 12. wherein the 
oligonucleotide probe comprises a nvicleotide sequence of at least 18 

15 micleotides. 

14. A method of detecting in a sample the presence of an effector protein 
for the Grb7 family of proteins, the method comprising reacting the sample 
with an antibody or fragment thereof according to claim 11. 

20 

15. A method of detecting in a sample the presence of mRNA encoding an 
effector protein for the Grb7 family of proteins, the method comprising 
reacting the sample with an oligonucleotide probe of claim 12 or 13. 
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FIGURE 1 

ATTCCTCTTCATAATGCATGCTCTTTTGGTCATGCTGAAGTAGTCAATCTCCTTTTGCGACATGGTGCAG 70 
I PLHNACSFGHAEVVNLLLRHG A 

ACCCCAATGCTCGAGATAATTGGAATTATACTCCTCTCCATGAAGCTGC7VATTAAAGGAAAGATTGATGT 14 0 
D p N A R D N WNYTPLHEAAIK G K I D V 

TTGCATTGTGCTGTTACAGCATGGAGCTGAGCCAACCATCCGAAATACAGATGGAAGGACAGCATTGGAT 210 
CIVLLQHGAEPTIRNT D G R T A L D 

TTAGCAGATCCATCTGCCAAAGCAGTGCTTACTGGTGAATATAAGAAAGATGAACTCTTAGAAAGTGCCA 28 0 
LADPSAKAVLTGEYKKDELLE S A 

GGAGTGGCAATGAAGAAAAAATGATGGCTCTACTCACACCATTAAATGTC7VACTGCCACGCAAGTGATGG 350 
R S G NEEKMMALLTPLNVNCHAS D G 

CAGAAAGTCAACTCCATTACATTTGGCAGCAGGATATAACAGAGTAAAGATTGTACAGCTGTTACTGCAA 420 
RKSTPLHLAAGYNRVKIVQLLLQ 

CATGGACGTGATGTCCATGCTAAAGATAAAGGTGATCTGGTACCATTACACAATGCCTGTTCTTATGGTC 4 90 
HGRDVHAKDK GDLVPLHNA C S Y G 

ATTATG/iiAGTAACTGAACTTTTGGTCAAGCATGGTGGCTGTGTAAATGCAATGGACTTGTGGCAATTCAC 560 
HYEVTELLVKHGGCVNAMDL W Q F T 

TCCTCTTCATGAGGCAGCTTCTAAGAACAGGGTTGAAGTATGTTCTCTTCTCTTAAGTTATGGTGCAGAC 630 
PLHEAASKNRVEVCSLLLSYGAD 

CCAACACTGCTCAATTGTAAGAATAAAAGTGCTATAGACTTGGCTCCCACACCACAGTTAAAAGAAAGAT 700 
p T L L N C KNKSAIDLAPTPQLKER 



TAGCATATGAATTTAAAGGCCACTCGTTGCTGCAAGCTGCACGAGAAGCTGATGTTACTCGAATCAAAAA 770 
LAYEFKGHS LLQAAREADVTR I KK 

ACATCTCTCTCTGGAAATGGTGAATTTCAAGCATCCTCAAACACATGAAACAGCATTGCATTGTGCTGCT 8 4 0 
HLSLEMVNFKHPQ THETALHC A A 

GCATCTCCATATCCCAAAAGAAAGCAAATATGTGAACTGTTGCTAAGAAAAGGAGCAAACATCAATGAAA 910 
ASPYPKRKQICELLLRKGANI N E 

AGACTAAAGAATTCTTGACTCCTCTGCACGTGGCATCTGAGAAAGCTCATAATGATGTTGTTGAAGTAGT 98 0 
K T K EFLTPLHVASEKAHNDVVEVV 

GGTGAAACATGAAGCAAAGGTTAATGCTCTGGATAATCTTGGTCAGACTTCTCTACACAGAGCTGCATAT 105 0 
VKHEAKVNALDN LGQTSLHRAA Y 

TGTGGTCATCTACAAACCTGCCGCCTACTCCTGAGCTATGGGTGTGATCCTAACATTATATCCCTTCAGG 1120 
CGHLQTCRLLLSYGCDPNJISL Q 

GCTTTACTGCTTTACAGATGGGAAATGAAAATGTACAGCAACTCCTCCAAGAGGGTATCTCATTAGGTAA 1190 
GFTALQMGNENVQQLLQEGI S L G N 

TTCAGAGGCAGACAGACAATTGCTGGAAGCTGCAAAGGCTGGAGATGTCGAAACTGTAA7>AAAACTGTGT 12 60 
SEADRQLL EAAKAGDVET V K K L C 



ACTGTTCAGAGTGTCAACTGCAGAGACATTGAAGGGCGTCAGTCTACACCACTTCATTTTGCAGCTGGGT 1330 
TVQSVNCRDIEG RQSTPLHFAAG 

ATAACAGAGTGTCCGTGGTGGAATATCTGCTACAGCATGGAGCTGATGTGCATGCTA7VAGATAAAGGAGG 14 00 
YNRVSVVEYLLQHGADVHAKDK G G 

CCTTGTACCTTTGCACAATGCATGTTCTTACGGACATTATGAAGTTGCAGAACTTCTTGTTAAACATGGA 1470 
LVPLHMACSYGHYEVAELLVKHG 



GCAGTAGTTAATGTAGCTGATTTATGGAAATTTACACCTTTACATGAAGCAGCAGCAA7VAGGAAAATATG 15 4 0 
AVVNVADL WKFTPLHEAAAKGKY 



AAATTTGCAAACTTCTGCTCCAGCATGGTGCAGACCCTACAAAAAAJ^CAGGGATGGAAATACTCCTTT 1610 
EICKLLLQHGADPTKKNR D G N T P L 
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GGATCTTGTTAAAGATGGAGATACAGATATTCTiJVGATCTGCTTAGGGGAGATGCAGCTTTGCTAGATGCT 1680 
DLVKDGDTD I QDLLRGDAALLDA 

GCCAAGAAGGGTTGTTTAGCCAGAGTG7VAG7VAGTTGTCTTCTCCTGATAATGTAAATTGCCGCGATACCC 175 0 
A K K G C-LAnVKKLSSPDNVNCRDT 



AAGGC AGACATTCAACACCTTTACATTTAGCAGCTGGTTATAATAATTTAGAAGTTGCAGAGTATTTGTT 1820 
Q G RHSTPLHLAAGYNNLEVAEYLL 



ACAACACGGAGCTGATGTGAATGCCCAAGACAAAGGAGGACTTATTCCTTTACATAATGCAGCATCTTAC 18 90 
QHGADVNAQDK GGLI PLHNAASY 

GGGCATGTAGATGTAGCAGCTCTACTAATAAAGTATAATGCATCTCTCAATGCCACGGAC7VAATGGGCTT 1960 
GHVDVAALLI KYNASLNATDK W A 

TCACACCTTTGCACGAAGCAGCCCAAAAGGGACGAACACAGCTTTGTGCTTTGTTGCTAGCCCATGGAGC 2030 
FT P LH EAAQ KG R T Q L CA LL LAHGA 

TGACCCGACTCTTAAAAATCAGGAAGGACAAACACCTTTAGATTTAGTTTCAGCAGATGATGTCAGCGCT 2100 
D P T L K N Q EGQTPLDLVSADDVSA 



CTTCTGACAGCAGCCATGCCCCCATCTGCTCTGCCCTCTTGTTACAAGCCTCAAGTGCTCAATGGTGTGA 2170 
LLTAAMPP5ALP5CYKP Q V L N G V 

GAAGCCCAGGAGCCACTGCAGATGCTCTCTCTTCAGGTCCATCTAGCCCATCAAGCCTTTCTGCAGCCAG 22 4 0 
RSPGATADALSSGPSSPSSLSAAS 

CAGTCTTGACAACTTATCTGGGAGTTTTTCAGAACTGTCTTCAGTAGTTAGTTC7VAGTGGAACAGAGGGT 2310 
SLDNLSGSFSELSSVVSSSGTEG 

GCTTCCAGTTTGGAGAAAAAGGAGGTTCCAGGAGTAGATTTTAGCATAACTCAATTCGTAAGGAATCTTG 23 8 0 
ASSLEKKEVPGVDFSITQFVRNL 

GACTTGAGCACCTAATGGATATATTTGAGAGAGAACAGATCACTTTGGATGTATTAGTTGAGATGGGGCA 2 45 0 
GLEHLMDIFEREQITLDVLVEMGH 

CAAGGAGCTGAAGGAGATTGGAATCAATGCTTATGGACATAGGCACAAACTAATTAAAGGAGTCGAGAGA 2520 
KELKEIGINAYGHRHKLIKGVER 

CTTATCTCCGGACAACAAGGTCTTAACCCATATTTAACTTTGAACACCTCTGGTAGTGGAACAATTCTTA 2 5 90 
LISGQQGLNPYLTLNTSGSGTIL 

TAGATCTGTCTCCTGATGATJVAAGAGTTTCAGTCTGTGGAGGAAGAGATGCAAAGTACAGTTCGAGAGCA 2 660 
IDLSPDDKEFQSVEEEMQSTVREH 

CAGAGATGGAGGTCATGCAGGTGGAATCTTCAACAGATACAATATTCTCAAGATTC AGAAGGTTTGTAAC 2730 
RDGGHAGGI FNRYNILKI QKVCN 

AAGAAACTATGGGAAAGATACACTCACCGGAGTVAAAGAAGTTTCTGAAGAT^AACCACAACCATGCC AATG 2 800 
KKLWERYTHRRKEVSEENHNHAN 

AACGAATGCTATTTCATGGGTCTCCTTTTGTGT^TGCTVATTATCCACAAAGGCTTTGATGAAAGGCATGC 2870 
ERMLFHGSPFVNAI IHKGFDERHA 

GTACATAGGTGGTATGTTTGGAGCTGGCATTTATTTTGCTGAAAACTCTTCCAAAAGCAATCAATATGTA 2 94 0 
YIGGMFGAGIYFAENSSKSNQYV 

TATGGAATTGGAGGAGGTACTGGGTGTCCAGTTCACAAAGACAGATCTTGTTACATTTGCCACAGGCAGC 3010 
YGIGGGTGCPVHKDRSCYICHRQ 

TGCTCTTTTGCCGGGTAACCTTGGGAAAGTCTTTCCTGCAGTTCAGTGCAATGAAAATGGCACATTCTCC 308 0 
LLFCRVTLGKS FLQFSAMKMAHSP 

TCCAGGTCATCACTCAGTCACTGGTAGGCCCAGTGTAAATGGCCTAGCATTAGCTGAATATGTTATTTAC 3150 
PGHHSVTGRPSVNGLALAEYVIY 

AGAGGAGAACAGGCTTATCCTGAGTATTTAATTACTTACCAGATTATGAGGCCTGAAGGTATGGTCGATG 322 0 
RGEQAYPEYLITYQIMRPEGMVD 

GAT7UUVTAGTTATTTTAAGAAACTAATTCCACTGAACCTAAAATCATCAAAGCAGCAGTGGCCTCTACGT 32 90 
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TTTACTCCTTTGCTGAAAAAAAATCATCTTGCCCACAGGCCTGTGGCAAAAGGATAAAAATGTGAACGAA 3360 
GTTTAACATTCTGACTTGATAAAGCTTTAATAATGTACAG 
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